Overview
Brought to you by YData
Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 699 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 9 |
| Duplicate rows (%) | 1.3% |
| Total size in memory | 60.2 KiB |
| Average record size in memory | 88.2 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 1 |
| Dataset has 9 (1.3%) duplicate rows | Duplicates |
Bare_Nuclei is highly overall correlated with Bland_Chromatin and 7 other fields | High correlation |
Bland_Chromatin is highly overall correlated with Bare_Nuclei and 7 other fields | High correlation |
Class is highly overall correlated with Bare_Nuclei and 8 other fields | High correlation |
Clump_Thickness is highly overall correlated with Bare_Nuclei and 7 other fields | High correlation |
Marginal_Adhesion is highly overall correlated with Bare_Nuclei and 7 other fields | High correlation |
Mitoses is highly overall correlated with Class and 2 other fields | High correlation |
Normal_Nucleoli is highly overall correlated with Bare_Nuclei and 8 other fields | High correlation |
Single_Epithelial_Cell_Size is highly overall correlated with Bare_Nuclei and 7 other fields | High correlation |
Uniformity_of_Cell_Shape is highly overall correlated with Bare_Nuclei and 7 other fields | High correlation |
Uniformity_of_Cell_Size is highly overall correlated with Bare_Nuclei and 8 other fields | High correlation |
Reproduction
| Analysis started | 2024-11-17 13:25:27.438563 |
|---|---|
| Analysis finished | 2024-11-17 13:25:33.612205 |
| Duration | 6.17 seconds |
| Software version | ydata-profiling vv4.12.0 |
| Download configuration | config.json |
Variables
Sample_Code_Number
Real number (ℝ)
| Distinct | 645 |
|---|---|
| Distinct (%) | 92.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1071704.1 |
| Minimum | 61634 |
|---|---|
| Maximum | 13454352 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 61634 |
|---|---|
| 5-th percentile | 411453 |
| Q1 | 870688.5 |
| median | 1171710 |
| Q3 | 1238298 |
| 95-th percentile | 1333890.8 |
| Maximum | 13454352 |
| Range | 13392718 |
| Interquartile range (IQR) | 367609.5 |
Descriptive statistics
| Standard deviation | 617095.73 |
|---|---|
| Coefficient of variation (CV) | 0.57580794 |
| Kurtosis | 257.71716 |
| Mean | 1071704.1 |
| Median Absolute Deviation (MAD) | 104381 |
| Skewness | 13.675326 |
| Sum | 7.4912116 × 108 |
| Variance | 3.8080714 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1182404 | 6 | 0.9% |
| 1276091 | 5 | 0.7% |
| 1198641 | 3 | 0.4% |
| 897471 | 2 | 0.3% |
| 1116192 | 2 | 0.3% |
| 385103 | 2 | 0.3% |
| 411453 | 2 | 0.3% |
| 1293439 | 2 | 0.3% |
| 1143978 | 2 | 0.3% |
| 560680 | 2 | 0.3% |
| Other values (635) | 671 |
| Value | Count | Frequency (%) |
| 61634 | 1 | |
| 63375 | 1 | |
| 76389 | 1 | |
| 95719 | 1 | |
| 128059 | 1 | |
| 142932 | 1 | |
| 144888 | 1 | |
| 145447 | 1 | |
| 160296 | 1 | |
| 167528 | 1 |
| Value | Count | Frequency (%) |
| 13454352 | 1 | |
| 8233704 | 1 | |
| 1371920 | 1 | |
| 1371026 | 1 | |
| 1369821 | 1 | |
| 1368882 | 1 | |
| 1368273 | 1 | |
| 1368267 | 1 | |
| 1365328 | 1 | |
| 1365075 | 1 |
Clump_Thickness
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.4177396 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.8157407 |
|---|---|
| Coefficient of variation (CV) | 0.63737135 |
| Kurtosis | -0.62371541 |
| Mean | 4.4177396 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.59285853 |
| Sum | 3088 |
| Variance | 7.9283955 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 145 | |
| 5 | 130 | |
| 3 | 108 | |
| 4 | 80 | |
| 10 | 69 | |
| 2 | 50 | 7.2% |
| 8 | 46 | 6.6% |
| 6 | 34 | 4.9% |
| 7 | 23 | 3.3% |
| 9 | 14 | 2.0% |
| Value | Count | Frequency (%) |
| 1 | 145 | |
| 2 | 50 | 7.2% |
| 3 | 108 | |
| 4 | 80 | |
| 5 | 130 | |
| 6 | 34 | 4.9% |
| 7 | 23 | 3.3% |
| 8 | 46 | 6.6% |
| 9 | 14 | 2.0% |
| 10 | 69 |
| Value | Count | Frequency (%) |
| 10 | 69 | |
| 9 | 14 | 2.0% |
| 8 | 46 | 6.6% |
| 7 | 23 | 3.3% |
| 6 | 34 | 4.9% |
| 5 | 130 | |
| 4 | 80 | |
| 3 | 108 | |
| 2 | 50 | 7.2% |
| 1 | 145 |
Uniformity_of_Cell_Size
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.1344778 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 5 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.0514591 |
|---|---|
| Coefficient of variation (CV) | 0.97351434 |
| Kurtosis | 0.098802885 |
| Mean | 3.1344778 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.2331366 |
| Sum | 2191 |
| Variance | 9.3114027 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 384 | |
| 10 | 67 | 9.6% |
| 3 | 52 | 7.4% |
| 2 | 45 | 6.4% |
| 4 | 40 | 5.7% |
| 5 | 30 | 4.3% |
| 8 | 29 | 4.1% |
| 6 | 27 | 3.9% |
| 7 | 19 | 2.7% |
| 9 | 6 | 0.9% |
| Value | Count | Frequency (%) |
| 1 | 384 | |
| 2 | 45 | 6.4% |
| 3 | 52 | 7.4% |
| 4 | 40 | 5.7% |
| 5 | 30 | 4.3% |
| 6 | 27 | 3.9% |
| 7 | 19 | 2.7% |
| 8 | 29 | 4.1% |
| 9 | 6 | 0.9% |
| 10 | 67 | 9.6% |
| Value | Count | Frequency (%) |
| 10 | 67 | 9.6% |
| 9 | 6 | 0.9% |
| 8 | 29 | 4.1% |
| 7 | 19 | 2.7% |
| 6 | 27 | 3.9% |
| 5 | 30 | 4.3% |
| 4 | 40 | 5.7% |
| 3 | 52 | 7.4% |
| 2 | 45 | 6.4% |
| 1 | 384 |
Uniformity_of_Cell_Shape
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2074392 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 5 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.9719128 |
|---|---|
| Coefficient of variation (CV) | 0.9265687 |
| Kurtosis | 0.00701098 |
| Mean | 3.2074392 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.1618592 |
| Sum | 2242 |
| Variance | 8.8322655 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 353 | |
| 2 | 59 | 8.4% |
| 10 | 58 | 8.3% |
| 3 | 56 | 8.0% |
| 4 | 44 | 6.3% |
| 5 | 34 | 4.9% |
| 6 | 30 | 4.3% |
| 7 | 30 | 4.3% |
| 8 | 28 | 4.0% |
| 9 | 7 | 1.0% |
| Value | Count | Frequency (%) |
| 1 | 353 | |
| 2 | 59 | 8.4% |
| 3 | 56 | 8.0% |
| 4 | 44 | 6.3% |
| 5 | 34 | 4.9% |
| 6 | 30 | 4.3% |
| 7 | 30 | 4.3% |
| 8 | 28 | 4.0% |
| 9 | 7 | 1.0% |
| 10 | 58 | 8.3% |
| Value | Count | Frequency (%) |
| 10 | 58 | 8.3% |
| 9 | 7 | 1.0% |
| 8 | 28 | 4.0% |
| 7 | 30 | 4.3% |
| 6 | 30 | 4.3% |
| 5 | 34 | 4.9% |
| 4 | 44 | 6.3% |
| 3 | 56 | 8.0% |
| 2 | 59 | 8.4% |
| 1 | 353 |
Marginal_Adhesion
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.806867 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.8553792 |
|---|---|
| Coefficient of variation (CV) | 1.0172834 |
| Kurtosis | 0.98794707 |
| Mean | 2.806867 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.5244681 |
| Sum | 1962 |
| Variance | 8.1531906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 407 | |
| 3 | 58 | 8.3% |
| 2 | 58 | 8.3% |
| 10 | 55 | 7.9% |
| 4 | 33 | 4.7% |
| 8 | 25 | 3.6% |
| 5 | 23 | 3.3% |
| 6 | 22 | 3.1% |
| 7 | 13 | 1.9% |
| 9 | 5 | 0.7% |
| Value | Count | Frequency (%) |
| 1 | 407 | |
| 2 | 58 | 8.3% |
| 3 | 58 | 8.3% |
| 4 | 33 | 4.7% |
| 5 | 23 | 3.3% |
| 6 | 22 | 3.1% |
| 7 | 13 | 1.9% |
| 8 | 25 | 3.6% |
| 9 | 5 | 0.7% |
| 10 | 55 | 7.9% |
| Value | Count | Frequency (%) |
| 10 | 55 | 7.9% |
| 9 | 5 | 0.7% |
| 8 | 25 | 3.6% |
| 7 | 13 | 1.9% |
| 6 | 22 | 3.1% |
| 5 | 23 | 3.3% |
| 4 | 33 | 4.7% |
| 3 | 58 | 8.3% |
| 2 | 58 | 8.3% |
| 1 | 407 |
Single_Epithelial_Cell_Size
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2160229 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 8 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.2142999 |
|---|---|
| Coefficient of variation (CV) | 0.68852118 |
| Kurtosis | 2.1690664 |
| Mean | 3.2160229 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.7121718 |
| Sum | 2248 |
| Variance | 4.903124 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 386 | |
| 3 | 72 | 10.3% |
| 4 | 48 | 6.9% |
| 1 | 47 | 6.7% |
| 6 | 41 | 5.9% |
| 5 | 39 | 5.6% |
| 10 | 31 | 4.4% |
| 8 | 21 | 3.0% |
| 7 | 12 | 1.7% |
| 9 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| 1 | 47 | 6.7% |
| 2 | 386 | |
| 3 | 72 | 10.3% |
| 4 | 48 | 6.9% |
| 5 | 39 | 5.6% |
| 6 | 41 | 5.9% |
| 7 | 12 | 1.7% |
| 8 | 21 | 3.0% |
| 9 | 2 | 0.3% |
| 10 | 31 | 4.4% |
| Value | Count | Frequency (%) |
| 10 | 31 | 4.4% |
| 9 | 2 | 0.3% |
| 8 | 21 | 3.0% |
| 7 | 12 | 1.7% |
| 6 | 41 | 5.9% |
| 5 | 39 | 5.6% |
| 4 | 48 | 6.9% |
| 3 | 72 | 10.3% |
| 2 | 386 | |
| 1 | 47 | 6.7% |
Bare_Nuclei
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4864092 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 5 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.6219288 |
|---|---|
| Coefficient of variation (CV) | 1.0388708 |
| Kurtosis | -0.72646662 |
| Mean | 3.4864092 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.0253473 |
| Sum | 2437 |
| Variance | 13.118368 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 418 | |
| 10 | 132 | 18.9% |
| 2 | 30 | 4.3% |
| 5 | 30 | 4.3% |
| 3 | 28 | 4.0% |
| 8 | 21 | 3.0% |
| 4 | 19 | 2.7% |
| 9 | 9 | 1.3% |
| 7 | 8 | 1.1% |
| 6 | 4 | 0.6% |
| Value | Count | Frequency (%) |
| 1 | 418 | |
| 2 | 30 | 4.3% |
| 3 | 28 | 4.0% |
| 4 | 19 | 2.7% |
| 5 | 30 | 4.3% |
| 6 | 4 | 0.6% |
| 7 | 8 | 1.1% |
| 8 | 21 | 3.0% |
| 9 | 9 | 1.3% |
| 10 | 132 | 18.9% |
| Value | Count | Frequency (%) |
| 10 | 132 | 18.9% |
| 9 | 9 | 1.3% |
| 8 | 21 | 3.0% |
| 7 | 8 | 1.1% |
| 6 | 4 | 0.6% |
| 5 | 30 | 4.3% |
| 4 | 19 | 2.7% |
| 3 | 28 | 4.0% |
| 2 | 30 | 4.3% |
| 1 | 418 |
Bland_Chromatin
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4377682 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 8 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.4383643 |
|---|---|
| Coefficient of variation (CV) | 0.70928698 |
| Kurtosis | 0.18462131 |
| Mean | 3.4377682 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.0999691 |
| Sum | 2403 |
| Variance | 5.9456202 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 166 | |
| 3 | 165 | |
| 1 | 152 | |
| 7 | 73 | |
| 4 | 40 | 5.7% |
| 5 | 34 | 4.9% |
| 8 | 28 | 4.0% |
| 10 | 20 | 2.9% |
| 9 | 11 | 1.6% |
| 6 | 10 | 1.4% |
| Value | Count | Frequency (%) |
| 1 | 152 | |
| 2 | 166 | |
| 3 | 165 | |
| 4 | 40 | 5.7% |
| 5 | 34 | 4.9% |
| 6 | 10 | 1.4% |
| 7 | 73 | |
| 8 | 28 | 4.0% |
| 9 | 11 | 1.6% |
| 10 | 20 | 2.9% |
| Value | Count | Frequency (%) |
| 10 | 20 | 2.9% |
| 9 | 11 | 1.6% |
| 8 | 28 | 4.0% |
| 7 | 73 | |
| 6 | 10 | 1.4% |
| 5 | 34 | 4.9% |
| 4 | 40 | 5.7% |
| 3 | 165 | |
| 2 | 166 | |
| 1 | 152 |
Normal_Nucleoli
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8669528 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 3.0536339 |
|---|---|
| Coefficient of variation (CV) | 1.0651148 |
| Kurtosis | 0.47426868 |
| Mean | 2.8669528 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.4222613 |
| Sum | 2004 |
| Variance | 9.32468 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 443 | |
| 10 | 61 | 8.7% |
| 3 | 44 | 6.3% |
| 2 | 36 | 5.2% |
| 8 | 24 | 3.4% |
| 6 | 22 | 3.1% |
| 5 | 19 | 2.7% |
| 4 | 18 | 2.6% |
| 7 | 16 | 2.3% |
| 9 | 16 | 2.3% |
| Value | Count | Frequency (%) |
| 1 | 443 | |
| 2 | 36 | 5.2% |
| 3 | 44 | 6.3% |
| 4 | 18 | 2.6% |
| 5 | 19 | 2.7% |
| 6 | 22 | 3.1% |
| 7 | 16 | 2.3% |
| 8 | 24 | 3.4% |
| 9 | 16 | 2.3% |
| 10 | 61 | 8.7% |
| Value | Count | Frequency (%) |
| 10 | 61 | 8.7% |
| 9 | 16 | 2.3% |
| 8 | 24 | 3.4% |
| 7 | 16 | 2.3% |
| 6 | 22 | 3.1% |
| 5 | 19 | 2.7% |
| 4 | 18 | 2.6% |
| 3 | 44 | 6.3% |
| 2 | 36 | 5.2% |
| 1 | 443 |
Mitoses
Real number (ℝ)
High correlation 
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5894134 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 5 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.7150779 |
|---|---|
| Coefficient of variation (CV) | 1.0790634 |
| Kurtosis | 12.657878 |
| Mean | 1.5894134 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.5606578 |
| Sum | 1111 |
| Variance | 2.9414923 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 579 | |
| 2 | 35 | 5.0% |
| 3 | 33 | 4.7% |
| 10 | 14 | 2.0% |
| 4 | 12 | 1.7% |
| 7 | 9 | 1.3% |
| 8 | 8 | 1.1% |
| 5 | 6 | 0.9% |
| 6 | 3 | 0.4% |
| Value | Count | Frequency (%) |
| 1 | 579 | |
| 2 | 35 | 5.0% |
| 3 | 33 | 4.7% |
| 4 | 12 | 1.7% |
| 5 | 6 | 0.9% |
| 6 | 3 | 0.4% |
| 7 | 9 | 1.3% |
| 8 | 8 | 1.1% |
| 10 | 14 | 2.0% |
| Value | Count | Frequency (%) |
| 10 | 14 | 2.0% |
| 8 | 8 | 1.1% |
| 7 | 9 | 1.3% |
| 6 | 3 | 0.4% |
| 5 | 6 | 0.9% |
| 4 | 12 | 1.7% |
| 3 | 33 | 4.7% |
| 2 | 35 | 5.0% |
| 1 | 579 |
Class
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.7 KiB |
| 2 | |
|---|---|
| 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 458 | |
| 4 | 241 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 458 | |
| 4 | 241 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 458 | |
| 4 | 241 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 699 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 458 | |
| 4 | 241 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 699 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 458 | |
| 4 | 241 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 699 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 458 | |
| 4 | 241 |
Interactions
Correlations
| Bare_Nuclei | Bland_Chromatin | Class | Clump_Thickness | Marginal_Adhesion | Mitoses | Normal_Nucleoli | Sample_Code_Number | Single_Epithelial_Cell_Size | Uniformity_of_Cell_Shape | Uniformity_of_Cell_Size | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Bare_Nuclei | 1.000 | 0.669 | 0.835 | 0.586 | 0.694 | 0.478 | 0.649 | -0.119 | 0.689 | 0.741 | 0.761 |
| Bland_Chromatin | 0.669 | 1.000 | 0.804 | 0.538 | 0.625 | 0.387 | 0.662 | -0.096 | 0.640 | 0.692 | 0.719 |
| Class | 0.835 | 0.804 | 1.000 | 0.738 | 0.738 | 0.519 | 0.768 | 0.000 | 0.791 | 0.860 | 0.875 |
| Clump_Thickness | 0.586 | 0.538 | 0.738 | 1.000 | 0.542 | 0.419 | 0.570 | -0.004 | 0.584 | 0.664 | 0.666 |
| Marginal_Adhesion | 0.694 | 0.625 | 0.738 | 0.542 | 1.000 | 0.447 | 0.634 | -0.050 | 0.668 | 0.712 | 0.743 |
| Mitoses | 0.478 | 0.387 | 0.519 | 0.419 | 0.447 | 1.000 | 0.504 | -0.075 | 0.480 | 0.473 | 0.509 |
| Normal_Nucleoli | 0.649 | 0.662 | 0.768 | 0.570 | 0.634 | 0.504 | 1.000 | -0.071 | 0.706 | 0.725 | 0.757 |
| Sample_Code_Number | -0.119 | -0.096 | 0.000 | -0.004 | -0.050 | -0.075 | -0.071 | 1.000 | -0.087 | -0.060 | -0.043 |
| Single_Epithelial_Cell_Size | 0.689 | 0.640 | 0.791 | 0.584 | 0.668 | 0.480 | 0.706 | -0.087 | 1.000 | 0.759 | 0.787 |
| Uniformity_of_Cell_Shape | 0.741 | 0.692 | 0.860 | 0.664 | 0.712 | 0.473 | 0.725 | -0.060 | 0.759 | 1.000 | 0.892 |
| Uniformity_of_Cell_Size | 0.761 | 0.719 | 0.875 | 0.666 | 0.743 | 0.509 | 0.757 | -0.043 | 0.787 | 0.892 | 1.000 |
Missing values
Sample
| Sample_Code_Number | Clump_Thickness | Uniformity_of_Cell_Size | Uniformity_of_Cell_Shape | Marginal_Adhesion | Single_Epithelial_Cell_Size | Bare_Nuclei | Bland_Chromatin | Normal_Nucleoli | Mitoses | Class | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1000025 | 5 | 1 | 1 | 1 | 2 | 1.0 | 3 | 1 | 1 | 2 |
| 1 | 1002945 | 5 | 4 | 4 | 5 | 7 | 10.0 | 3 | 2 | 1 | 2 |
| 2 | 1015425 | 3 | 1 | 1 | 1 | 2 | 2.0 | 3 | 1 | 1 | 2 |
| 3 | 1016277 | 6 | 8 | 8 | 1 | 3 | 4.0 | 3 | 7 | 1 | 2 |
| 4 | 1017023 | 4 | 1 | 1 | 3 | 2 | 1.0 | 3 | 1 | 1 | 2 |
| 5 | 1017122 | 8 | 10 | 10 | 8 | 7 | 10.0 | 9 | 7 | 1 | 4 |
| 6 | 1018099 | 1 | 1 | 1 | 1 | 2 | 10.0 | 3 | 1 | 1 | 2 |
| 7 | 1018561 | 2 | 1 | 2 | 1 | 2 | 1.0 | 3 | 1 | 1 | 2 |
| 8 | 1033078 | 2 | 1 | 1 | 1 | 2 | 1.0 | 1 | 1 | 5 | 2 |
| 9 | 1033078 | 4 | 2 | 1 | 1 | 2 | 1.0 | 2 | 1 | 1 | 2 |
| Sample_Code_Number | Clump_Thickness | Uniformity_of_Cell_Size | Uniformity_of_Cell_Shape | Marginal_Adhesion | Single_Epithelial_Cell_Size | Bare_Nuclei | Bland_Chromatin | Normal_Nucleoli | Mitoses | Class | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 689 | 654546 | 1 | 1 | 1 | 1 | 2 | 1.0 | 1 | 1 | 8 | 2 |
| 690 | 654546 | 1 | 1 | 1 | 3 | 2 | 1.0 | 1 | 1 | 1 | 2 |
| 691 | 695091 | 5 | 10 | 10 | 5 | 4 | 5.0 | 4 | 4 | 1 | 4 |
| 692 | 714039 | 3 | 1 | 1 | 1 | 2 | 1.0 | 1 | 1 | 1 | 2 |
| 693 | 763235 | 3 | 1 | 1 | 1 | 2 | 1.0 | 2 | 1 | 2 | 2 |
| 694 | 776715 | 3 | 1 | 1 | 1 | 3 | 2.0 | 1 | 1 | 1 | 2 |
| 695 | 841769 | 2 | 1 | 1 | 1 | 2 | 1.0 | 1 | 1 | 1 | 2 |
| 696 | 888820 | 5 | 10 | 10 | 3 | 7 | 3.0 | 8 | 10 | 2 | 4 |
| 697 | 897471 | 4 | 8 | 6 | 4 | 3 | 4.0 | 10 | 6 | 1 | 4 |
| 698 | 897471 | 4 | 8 | 8 | 5 | 4 | 5.0 | 10 | 4 | 1 | 4 |
Duplicate rows
Most frequently occurring
| Sample_Code_Number | Clump_Thickness | Uniformity_of_Cell_Size | Uniformity_of_Cell_Shape | Marginal_Adhesion | Single_Epithelial_Cell_Size | Bare_Nuclei | Bland_Chromatin | Normal_Nucleoli | Mitoses | Class | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 320675 | 3 | 3 | 5 | 2 | 3 | 10.0 | 7 | 1 | 1 | 4 | 2 |
| 1 | 466906 | 1 | 1 | 1 | 1 | 2 | 1.0 | 1 | 1 | 1 | 2 | 2 |
| 2 | 704097 | 1 | 1 | 1 | 1 | 1 | 1.0 | 2 | 1 | 1 | 2 | 2 |
| 3 | 733639 | 3 | 1 | 1 | 1 | 2 | 1.0 | 3 | 1 | 1 | 2 | 2 |
| 4 | 1100524 | 6 | 10 | 10 | 2 | 8 | 10.0 | 7 | 3 | 3 | 4 | 2 |
| 5 | 1116116 | 9 | 10 | 10 | 1 | 10 | 8.0 | 3 | 3 | 1 | 4 | 2 |
| 6 | 1198641 | 3 | 1 | 1 | 1 | 2 | 1.0 | 3 | 1 | 1 | 2 | 2 |
| 7 | 1218860 | 1 | 1 | 1 | 1 | 1 | 1.0 | 3 | 1 | 1 | 2 | 2 |
| 8 | 1321942 | 5 | 1 | 1 | 1 | 2 | 1.0 | 3 | 1 | 1 | 2 | 2 |